AITopics | individual iterate

Neural Information Processing Systems http://nips.cc/

algorithm, algorithm 1, iterate, (14 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Rhineland-Palatinate > Kaiserslautern (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
North America > Canada (0.04)
(3 more...)

Industry: Education > Educational Setting > Online (0.51)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.50)

Add feedback

Optimal Stochastic and Online Learning with Individual Iterates

Neural Information Processing SystemsDec-25-2025, 05:21:45 GMT

Stochastic composite mirror descent (SCMD) is a simple and efficient method able to capture both geometric and composite structures of optimization problems in machine learning. Existing strategies require to take either an average or a random selection of iterates to achieve optimal convergence rates, which, however, can either destroy the sparsity of solutions or slow down the practical training speed. In this paper, we propose a theoretically sound strategy to select an individual iterate of the vanilla SCMD, which is able to achieve optimal rates for both convex and strongly convex problems in a non-smooth learning setting. This strategy of outputting an individual iterate can preserve the sparsity of solutions which is crucial for a proper interpretation in sparse learning problems. We report experimental comparisons with several baseline methods to show the effectiveness of our method in achieving a fast training speed as well as in outputting sparse solutions.

individual iterate, name change, optimal stochastic and online learning, (3 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.44)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.79)

Add feedback

Optimal Stochastic and Online Learning with Individual Iterates

Yunwen Lei, Peng Yang, Ke Tang, Ding-Xuan Zhou

Neural Information Processing SystemsOct-2-2025, 12:11:54 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, iterate, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.68)

Industry: Education > Educational Setting > Online (0.51)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

Reviews: Optimal Stochastic and Online Learning with Individual Iterates

Neural Information Processing SystemsJan-22-2025, 18:15:04 GMT

This paper proposes an online stochastic optimization algorithm (similar to SGD) that has optimal convergence rate of the last iterate in two settings (O(1/sqrt(T)) for Lipschitz convex functions and O(1/T) strongly convex functions), and additionally it allows an arbitrary non-smooth regularizer (e.g. Many subsets of the properties are achieved by prior works. Namely, it was known how to achieve these results up to O(log T) factors. It was known how to achieve the optimal rates with averaging, which, however, destroys sparsity. However, this paper has the first algorithm that has all the properties simultaneously and removes the log factors. The paper has rigorous proofs of the convergence rates and extensive numerical experiments.

convergence rate, individual iterate, optimal stochastic and online learning, (3 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence (0.70)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.40)

Add feedback

Optimal Stochastic and Online Learning with Individual Iterates

Neural Information Processing SystemsOct-9-2024, 19:17:58 GMT

Stochastic composite mirror descent (SCMD) is a simple and efficient method able to capture both geometric and composite structures of optimization problems in machine learning. Existing strategies require to take either an average or a random selection of iterates to achieve optimal convergence rates, which, however, can either destroy the sparsity of solutions or slow down the practical training speed. In this paper, we propose a theoretically sound strategy to select an individual iterate of the vanilla SCMD, which is able to achieve optimal rates for both convex and strongly convex problems in a non-smooth learning setting. This strategy of outputting an individual iterate can preserve the sparsity of solutions which is crucial for a proper interpretation in sparse learning problems. We report experimental comparisons with several baseline methods to show the effectiveness of our method in achieving a fast training speed as well as in outputting sparse solutions.

individual iterate, optimal stochastic and online learning, sparsity, (1 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.85)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.40)

Add feedback

Optimal Stochastic and Online Learning with Individual Iterates

Lei, Yunwen, Yang, Peng, Tang, Ke, Zhou, Ding-Xuan

Neural Information Processing SystemsMar-18-2020, 22:46:27 GMT

Stochastic composite mirror descent (SCMD) is a simple and efficient method able to capture both geometric and composite structures of optimization problems in machine learning. Existing strategies require to take either an average or a random selection of iterates to achieve optimal convergence rates, which, however, can either destroy the sparsity of solutions or slow down the practical training speed. In this paper, we propose a theoretically sound strategy to select an individual iterate of the vanilla SCMD, which is able to achieve optimal rates for both convex and strongly convex problems in a non-smooth learning setting. This strategy of outputting an individual iterate can preserve the sparsity of solutions which is crucial for a proper interpretation in sparse learning problems. We report experimental comparisons with several baseline methods to show the effectiveness of our method in achieving a fast training speed as well as in outputting sparse solutions.

individual iterate, optimal stochastic and online learning, sparsity, (1 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.91)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.40)

Add feedback

Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes

Shamir, Ohad, Zhang, Tong

arXiv.org Machine LearningDec-28-2012

Stochastic Gradient Descent (SGD) is one of the simplest and most popular stochastic optimization methods. While it has already been theoretically studied for decades, the classical analysis usually required non-trivial smoothness assumptions, which do not apply to many modern applications of SGD with non-smooth objective functions such as support vector machines. In this paper, we investigate the performance of SGD without such smoothness assumptions, as well as a running average scheme to convert the SGD iterates to a solution with optimal optimization accuracy. In this framework, we prove that after T rounds, the suboptimality of the last SGD iterate scales as O(log(T)/\sqrt{T}) for non-smooth convex objective functions, and O(log(T)/T) in the non-smooth strongly convex case. To the best of our knowledge, these are the first bounds of this kind, and almost match the minimax-optimal rates obtainable by appropriate averaging schemes. We also propose a new and simple averaging scheme, which not only attains optimal rates, but can also be easily computed on-the-fly (in contrast, the suffix averaging scheme proposed in Rakhlin et al. (2011) is not as simple to implement). Finally, we provide some experimental illustrations.

artificial intelligence, iterate, machine learning, (16 more...)

arXiv.org Machine Learning

1212.1824

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology: